Distinct patterns of simple sequence repeats and GC distribution in intragenic and intergenic regions of primate genomes
نویسندگان
چکیده
As the first systematic examination of simple sequence repeats (SSRs) and guanine-cytosine (GC) distribution in intragenic and intergenic regions of ten primates, our study showed that SSRs and GC displayed nonrandom distribution for both intragenic and intergenic regions, suggesting that they have potential roles in transcriptional or translational regulation. Our results suggest that the majority of SSRs are distributed in non-coding regions, such as the introns, TEs, and intergenic regions. In these primates, trinucleotide perfect (P) SSRs were the most abundant repeats type in the 5'UTRs and CDSs, whereas, mononucleotide P-SSRs were the most in the intron, 3'UTRs, TEs, and intergenic regions. The GC-contents varied greatly among different intragenic and intergenic regions: 5'UTRs > CDSs > 3'UTRs > TEs > introns > intergenic regions, and high GC-content was frequently distributed in exon-rich regions. Our results also showed that in the same intragenic and intergenic regions, the distribution of GC-contents were great similarity in the different primates. Tri- and hexanucleotide P-SSRs had the most GC-contents in the 5'UTRs and CDSs, whereas mononucleotide P-SSRs had the least GC-contents in the six genomic regions of these primates. The most frequent motifs for different length varied obviously with the different genomic regions.
منابع مشابه
Simple sequence repeats in organellar genomes of rice: frequency and distribution in genic and intergenic regions
MOTIVATION Simple sequence repeats (SSRs) are abundant across genomes. However, the significance of SSRs in organellar genomes of rice has not been completely understood. The availability of organellar genome sequences allows us to understand the organization of SSRs in their genic and intergenic regions. RESULTS We have analyzed SSRs in mitochondrial and chloroplast genomes of rice. We ident...
متن کاملMicrosatellites in different eukaryotic genomes: survey and analysis.
We examined the abundance of microsatellites with repeated unit lengths of 1-6 base pairs in several eukaryotic taxonomic groups: primates, rodents, other mammals, nonmammalian vertebrates, arthropods, Caenorhabditis elegans, plants, yeast, and other fungi. Distribution of simple sequence repeats was compared between exons, introns, and intergenic regions. Tri- and hexanucleotide repeats prevai...
متن کاملThe Complete Chloroplast Genome of Ye-Xing-Ba (Scrophularia dentata; Scrophulariaceae), an Alpine Tibetan Herb
Scrophularia dentata is an important Tibetan medicinal plant and traditionally used for the treatment of exanthema and fever in Traditional Tibetan Medicine (TTM). However, there is little sequence and genomic information available for S. dentata. In this paper, we report the complete chloroplast genome sequence of S. dentata and it is the first sequenced member of the Sect. Tomiophyllum within...
متن کاملChloroplast Genome Variation in Upland and Lowland Switchgrass
Switchgrass (Panicum virgatum L.) exists at multiple ploidies and two phenotypically distinct ecotypes. To facilitate interploidal comparisons and to understand the extent of sequence variation within existing breeding pools, two complete switchgrass chloroplast genomes were sequenced from individuals representative of the upland and lowland ecotypes. The results demonstrated a very high degree...
متن کاملSSR repeat dynamics in mitochondrial genomes of five domestic animal species
SSR (simple sequence repeats) are ubiquitously abundant in genomes. In organellar mitochondrial genome of animals, its distribution, size dynamics and effectiveness for phylogenetic relationship have not been understood. Present investigation reveals organisation of SSR in genic and intergenic region, its length and repeat motif dynamics, extent of conservation of flanking regions, appropriaten...
متن کامل